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Abstract. We are going to give necessary and sufficient conditions for two weight 
norm inequalities for Haar multipliers operators and for square functions. We also 
give sufficient conditions for two weight norm inequalities for the Hilbert transform. 



0. Introduction 

Weighted norm inequalities for singular integral operators appear naturally in many 
areas of analysis, probability, operator theory ect. 

The one-weight case is now pretty well understood, and the answers are given by 
the famous Helson-Szego theorem and the Hunt-Muckenhoupt-Wheden Theorem. 
The fist one state that the Hilbert Transform H is bounded in the weighted space 
L 2 (w) if and only if w can be represented as w = exp{w + Hv}, where u, v G 

IMIoc < 7f/2. 

The Hunt-Muckenhoupt-Wheden Theorem states that the Hilbert transform H is 
bounded in L p (w) if and only if the weight w satisfies the so-called Muckenhoupt A p 
condition 

i r \ t\ r _i /(p _d\p- 



(A p ) sup^—y^j.^—yty-vip-xjj <0O 



where the supremum is taken over all intervals /. This condition is also necessary 
and sufficient fo boundedness for a wide class of singular integral operators, as well 
as for the boundedness of the maximal operator M, 



M /(x) = sup^ f |/|; 

I3x M | J I 



here supremum is taken over all intervals I containing x. 

It is worth mentioning, that there in no direct proof of equivalence the Helson- 
Szego condition and the Muckenhoupt condition A 2 . 

Two weight inequalities, i. e. the problem when an operator acts from L 2 (w) to 
L 2 (v) (one can also consider LP case, even with different exponents p, but the L 2 case 
is complicated enough, so we restrict our attention on it) appears naturally in many 
areas like the theory of Hankel and Toeplitz operators, perturbation theory, etc. 
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Things look much more complicated in the two-weight case, and it is probably an 
agreement now that there is no simple (Muckenhoupt type) necessary and sufficient 
condition of boundedness of the Hilbert Transform. 

It was a big surprise when Eric Sawyer |ST| found necessary and sufficient condition 



for a maximal operator M to be a bounded operator from L 2 (w) to L 2 (v): his 
theorem states that it is enough to test the boundedness on a very special class 
of test functions, namely only on functions x^" 1 (and we should do the same for 
the adjoint operator) .[] We will call such type of conditions Sawyer type conditions. 

There is also a two weight analog of Helson-Szego theorem due to M. Cotlar and 
C. Sadosky, see ||CS1|| . Their approach (which can be referred as Generalized Bochner 



Theorem) provides both integral representation and extension of forms and kernels 
invariant under the shift operator. Being applied to a special bilinear form built with 
the help of the Hilbert transform and two measures, this approach gives a necessary 
and sufficient condition for the Hilbert transform to be bounded between L 2 -spaces 
with respect to these measures (see [CS1]). The approach of M. Cotlar, C. Sadosky 
is very intersting because it provides a direct link between the lifting theory of Sz.- 
Nagy and Foias (and thus the scattering theory) and the continuity of the Hilbert 
transform in weighted spaces (see 0). 

But there is no analog of the Muckenhoupt A 2 condition for two weights, which 
is necessary and sufficiend for the boundedness of the Hilbert transform. There are 
quite a few sufficient conditons, let us mention a very nice and simple one due to 
Dechao Zheng Eh . 



In this paper we are going to consider an operator (more precisely, a family of oper- 
ators, the so called Haar multipliers) which can serve as a good "model" for singular 
integral operators. For such operator we give necessary and sufficient conditions (of 
Sawyer type) of the boundedness. 

Our operators appear to be simpler than the Hilbert transform and we believe 
that our approach splits the difficulties of two weight singular integral estimates and 
allows to treat these difficulties separately. 

So let us explain what is our "model" operator. Let T> denote the set of dyadic 
subarcs of the real line K. Let a = (vj) IeT) be a sequence of signs ±. We will be 
dealing actually with the following family of operators. Let / , I + denote the left and 
the right halves of a dyadic interval / and let 



h, 



+\i\- 1/2 x I _ 



\i\- 1/2 x 



denote a Haar function normalized in L 2 = L 2 (R). Let (•, •) denote the scalar product 
in L 2 . We are interested in the following question: How to describe the pairs (fi, v) 
of weights on R such that all operators T CT , 



lawyer's theorem states more, and treats the L p -case as well, but we are not going into details 
here 
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(0.1) T a f = Y J ° I U,h I )h I 

ieD 

are uniformly bounded from L 2 (fi) to L 2 [y) with respect to all possible choice of a? 

It is easy to see that the measure v has to be absolutely continuous. It is also not 
difficult to see that singular part of \i does not help: if operators T a are uniformly 
bounded from L 2 (fj,) to L 2 {y), then the same holds if we replace fi by its absolutely 
continous part. So, without loss of generality one can assume that the measures fi 
and v are absolutely continuous, dfi = udt, dv = vdt. 

If the operators T a are uniformly bounded, then the operators 

f^U,h I ) L2 h v lev 

are uniformly bounded as well. For a fixed I the norm of the above operator can be 
easily computed — it is just a rank one operator — and it is equal to (v ) l J 2 {u' 1 ) 1 ^ 2 . 
So we get a simple necessary condition 

sup(t>) (u^ 1 ) < oo, 
lev 

which can be considered as a two-weight analog of the Muckenhoupt A2 condition. 
Unfortunately this condition is not sufficient. 

It is convenient to denote w := it -1 . In this notation the unifrom boundedness of 
operator T a L 2 (u) — > L 2 (v) is equivalent to the uniform boundedness of the opera- 
tors Mv^ 2 T a Mw 2 in usual (non- weighted) L 2 ; here M v and M w denote operators of 
multiplication by v and w respectively. 

And as we have shown above, the following condition 

sup(t> ) ,(u>) 7 < C 
lev 

is necessary for the uniform boundedness of all Mv^T^Mw 2 in L 2 or equivalently for 
the uniform boundedness of all operators T a : L 2 (w _1 ) — > L 2 {y). 

The above uniform boundedness of T a admits a simple geometric interpretation 
in terms of so-called multipliers. Consider the family M. of bounded operators A : 
L 2 {w~ l ) — > L 2 (v) such that they commute with all operators / t— > (/, hSjhj, IeD. 
We call this family the family of Haar multipliers. 

These are operators given by a simple formula Ahj = a ] h I . If a = {ci' I } IeV is 
given, let us call this operator A a , 

Now one can note that uniform boundedness of T a is equivalent to the inclusion 

(0.2) I°°CM 

in the sense that A a E M for all a G £°°. 

We are going to investigate the question when ( p.2| ) holds, that is when the family 
of Haar multipliers contains £°°. 
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We are going to formulate three main results now. Notice that Theorems p.2| and 
|Q| together give the necessary and sufficient conditions for 

(0.3) sup || Ml'^Mll 2 1| < oo 

and Theorem p.l| also gives the necessary and sufficient conditions in a completely 
diferent form. 



1/2 1/2 

Theorem 0.1. The family of singular integrals M v T a MJ is uniformly bounded 
in L 2 if and only if 

1 



1. V J G V sup 

(7 

2. V J G V sup 



1 



\Ta(XjW)\ vdx < C(w)j ; 
\Ta{XjV)\ 2 w dx < C(v) . 



Note, that any of the above conditions 1 or 2 immediately implies taht 

VJgD (v)j(w)j < C < oo, 

which is the necessary condition we discussed above. 

To formulate the next theorem, let us introduce some notation. Let us denote 



o, 



(w)j_ - (w) 



1+ 



where J_, J + are left and right halves of /. Consider an integral operator T given by 
the formula 



lev 1 1 



whose kernel k(x,y) = Yliev \^\ 2 X I ( x )x I (y) a i is evidently positive. 



Theorem 0.2. The family of operators M v T a MJ is uniformly bounded in L 2 if 
and only if the following four assertions hold simultaneously: 

l.yjeV (v) J (w) J < C < oo; 

2- V J G V |1 ^2\(v)j_ - (^) J+ | 2 M 7 |/| < C(v) j; 

' ' ICJ 
|J| ICJ 

4. The operator T is bounded from ^(w^ 1 ) to L 2 (v), or, equivalently, the operator 
My TqMw is bounded in L 2 . 

Theorem 0.3. The operator T is bounded from L 2 ^ -1 ) to L 2 (v) if and only if 

l.yjEV -L I (j2^- l X I (w) I a i yvdx<C(w) J ; 
I I J J IcJ I I 
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2- yJeV Xffc^xMplfwdx^CWj. 
I I J J Ic j I I 

Theorem 0.1 looks surprising. If it would concern an operator with positive kernel it 
would be in the vein of Sawyer's weighted theorems from [SI], [S2]. In fact, exactly as 
in [SI], [S2], Theorem 0.1 claims that (the family of) integral operators are bounded 
if and only if (the family of) integral operators are bounded on test functions xj 
and the adjoint operators are bounded on test functions \j. However, the family T e 
models a singular integral operator rather than a positive kernel integral operator. 
Unlike the case of positive kernel integral operators it now seems surprising that 
boundedness on xj implies boundedness on smaller positive functions. 

On the other hand, the classical Tl theorem says exactly the same: if an operator 
with Calderon-Zygmund kernel is uniformly bounded on \j then it is bounded. Re- 
cently, (see [NTV] and [T]) the Tl theorem was extended to nonhomogeneous spaces 
(spaces with non-doubling measure). It turned out that this generalization plays an 
important role in the treatment of analytic capacity problems including a famous 
problem of Ahlfors-Vitushkin. 

The method we use to prove these theorems consists of constructing the Bellman 
function of the problems we consider. Roughly speaking, we try to solve an extremal 
problem associated to a given problem (simply speaking,we try to consider the worst 
possible case). This leads to Bellman function of the problem. This approach re- 
sembles the approach of Burkholder ([Bu]). But there is a difference. We are not 
solving the extremal problem mentioned above (we would only wish). Instead, we 
are looking for a kind of subsolution of an associated system of Partial Differential 
(In) Equalities. 

Theorem p.3| can be most probably proved using the combination of ideas frim the 
papers of Kalton, Verbitsky [KV] and Sawyer, Wheeden [SW]. Following [KV] we can 
introduce the new metric d(x, y) := k ^ ^ , where k is the kernel of T and was written 
above. It is clearly a metric, and balls in this metric are just all dyadic intervals. 
Then kernel k satisfies the regularity condition from [SW]. Using the main result of 
[SW] we could have given an alternative proof of the theorem. Unfortunately it is 
not clear why the new metric space has certain regularity properties. For example, 
in [KV] one requires the property that all annuli be nonempty. This is false in our 
new metric space. However, it is not clear to us how essential are those regularity 
properties of the metric space for the application of the technique of [SW]. 

Acknowledgements. We are grateful to Peter Jones, Robert Fefferman and Igor 
Verbitsky for valuable discussions of this paper. 

1. Necessary conditions 

1/2 1/2 

As it was shown above in the Introduction, the uniform boundedness of M v T a M w 
implies that the operators / i— > (f,h T )hj are uniformly bounded as operators from 
L 2 {w~ l ) to L 2 (v), and the later condition is equivalent to 



(1.1) 



(v) I (w) I <C<oo. 
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It is well known that for the case of one weight (w^ 1 — v in our notation) this caon- 
dition is just the famous Muckenhoupt A 2 condition, and it is a sufficient condition 
for the unoform boundedness of Ml T^Mw . 

But in general case we have other simple necessary conditions which are indepen- 
dent of To get on of the condition let us apply the operator Ml^T^Mw 2 to 
the test function w x l 2 Xj- We get 



/ | ^2 a A w ' Xjfh^h^x) v(x)dx 



Let us now take the average over all possible choices of signs a / . 
Fix a function g in L 2 {w~ l ) and let {siiu)} be the sequence of independent random 
variables assuming values ±1 with probabilities 1/2, 1/2. Then 



d¥ \y2s I (u){g,hi)hi{x)\ 2 v dx < C\\g\ 



|2 

lev 



and 



which is 



/ y2\(9,h I )\ 2 ^-v(x)dx<C [ \g\ 2 w~ l 
J j xeI Ml Jt 

J2^)i--(9)i + \ 2 (v}i-\I\<C ! \g\ 2 w~ 
lev ^ T 



Choose g = xjw. Then we come to the following necessary condition (notice that we 
keep only the summation over I C J) 

(1.2) WJeV, ±-J2\( w )i--( w )i + \ 2 vi-\l\<C(w)j. 

|J| ICJ 

Symmetrically (we can interchange w and v ) 

(1-3) VJGD, ■^'52\(v) I _-(v) I+ \ 2 v) I -\I\<C(v)j 

|J| ICJ 

If w" 1 = u = v , then, say, (1.2) becomes 



(1.4) ( {w), -~ {wK ) 2 W<c(u,h 

\J\ f^j V J 

which follows from (A2) condition (1.1) on w (nontrivially, see e.g. [FKP] or [B]). 

But in general neither (1.2) nor (1.3) follows from (1.1). This is shown in [N]. In 
what follows we will try to establish to what extent (1.1), (1-2), (1.3) is the full list 
of necessary and sufficient conditions. 

In general this is not the case, but first we separate some cases when (1.1)- (1.3) or 
their simple modifications are sufficient for 

(1.5) SUp HTeH^-l)^^) < C < 00. 
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2. 2. Reduction of Theorem 0.1 to Therems 0.2, 0.3 and the proof of 

Theorem 0.2. 

Let us use the notation (•, •), (•, -) w -i, and (•, •)„ for the dualities in L 2 , L 2 {w~ r ), 
and L 2 (v). Let T be an arbitrary operator from L 2 (w~ r ) to L 2 {v) with the norm 
llTH^-i)^^) = ^\\f\\ 2 <i,\\ g \\2<i \( Twl/2 9,v 1/2 f)\, where || • || 2 denotes the usual 
L 2 -norm. 

We are interested in estimating from above ||T||. Thus we can assume for the time 
being that / and g are bounded as long as the final estimates of ||T|| will not depend 
on these bounds. 

Adopting this convention let us write for arbitrary f,g& L°° the following de- 
compositions: the decomposition of fw 1 ^ 2 and gv 1 / 2 with respect to the usual Haar 
basis: 

fw 1/2 = £(/V /2 ^.7, 

9V 1 ' 2 = J2(9y /2 hi)hr. 

The reader can notice that we are using the usual Haar system {hi} and its 
biorthogonal system {const/ — } in L 2 (v) to set up the decomposition in the output 
space L 2 (v) and we proceed similarly in the input space L 2 (w~ r ). 

Now the expression {Tw l l 2 g,v 1 l 2 f) involved in the formula for the norm ||T|| be- 
comes 

(2.1) (Tw^fy^g) = Y,(9y /2 hi){f,w 1/2 hj){ThjM- 
For the operators T £ we have 

(2.2) (T £ w l l 2 fyi 2 g) = ^lig^v^h^w^hj). 

lev 

To estimate the latter expression we are going to use 

2.1. Disbalanced Haar functions. This is the system of functions {hj}i eT > having 
the following properties: 

1) hj vanishes outside of / and equals to two different constants on the left and 
on the right halves of /, 

2) / hfwdx = 0, 

3) ||^7 \\l 2 (w) = 1- 

Then this is an orthonormal system in L 2 (w). Such kind of system with nonpositive 
weight z'(s)ds has been used in [CJS] to give a simple solution of a problem of 
Calderon. The following identity plays an important part below: 



(2.3) 



Xl ■ hi = hf + Ai ■ xi, 
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where the constants xj and Aj are uniquelty defined by the properties of {hf} listed 
above. Let us compute them. Using 2) and 3) we get 

A ■ Hi = \\h7\\h iw) + A]\I\{w)i = 1 + A)\I\{w)j 

Considering the scalar product of both parts of (1.8) with the constant function in 
L 2 (w) we get /&/)„, = Ai\I\(w)i. 
Thus 



Xj = 



Ai = 



Hi 



Hi-Hi + 3 

xi (w)i--Hi+. 



2y/\T\ Hi 
A t 1 Hi- - Hh 



xi 2a/|7[ Hi 
And having this in mind let us plug (2.3) into (2.2) to get 4 sums: 

(T £ w^f, v^g) = $> (^, hi) (fw~^, hj) w 
lev v 

lev 1 1 



i^i 



Y.^9v 1 %{fw- 1 '\W) w . ^ v A^\I\ 



lev 



lev 1 1 

= Si + Y>2 + S3 + S4. 

To reduce Theorem 0.1 to Theorems 0.2, 0.3 we will prove that the assumptions of 
Theorem 0.1 imply the assumptions of Theorems 0.2, 0.3. We already saw that the 
averaging over signs of the second assumption of Theorem 0.1 implies (1.2) and the 
averaging over signs of the third assumption of Theorem 0.1 implies (1.3). So in our 
reduction we can already use (1.2), (1.3) freely. 

Four sums above define four linear operators whose bilinear forms are given by the 
corresponding sums. Let us call them D e , U £ , U E , T 0i£ correspondingly. 

A very important remark now is the following. Suppose that all assumptions of 
Theorem 0.1 are satisfied. Then operators v l l 2 T £ w l l 2 are uniformly (in e) bounded 
on /, g if and only if T 0i£ are uniformly bounded on /, g. 

This is true because of the following analysis of operators D e ,n £ ,ILl, which says 
that these operators are uniformly (in e) bounded if the assumptions of Theorem 0.1 
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are satisfied. To analyse these operators is the same as to analyse their bilinear forms 
given by = 1,2,3. 

2.2. Estimates of ^2 v i = 1,2,3. There is nothing to estimate in ^ r Notice that 
~u < J (v)j so we can use the bound -Af-tf < \/ Jv)i\/ (w) I < C to write 

Xj V X j X j v V 



(2.4) 



Si < C\\-^\\ L2{V) ■ \\fw^ 2 \\ LHw) = C\\g\\ 2 ■ ||/|| 2 . 



To estimate J2 2 ^ us notice that 



I I 



\i\<vWi {w)i -~ {w)i u /\h 



Then 



S 2 < 



.1/2 



£</- 1/2 >»/ 



xiev 



{w)i_ - {w)i 
Hi 



1/2 



= imi 2 -(e<^ 1/2 >>>,( 

Kiev ^ 



~ {w)i 
Hi 



\I\ 

1/2 



1/1 



To estimate the last expression let us use the following lemma. 
Lemma 2.1. Let {«/}/ e D be a sequence of nonnegative numbers. Then 

J2(f™ 1/2 )>i < cum 

lev 

if and only if for all 

fJjE/cjM?^ ^ C(w)j. 



J, 



We postpone the proof till Section 6. If we use this lemma and (1.2) we can finish 
the estimate of J^ 2 : 



(2.5) 

Similarly, (1.3) gives 



S 2 <C 2 ||/|| 2 -|b|| 2 . 



(2.6) Z 3 <C 3 \\f\\ 2 -\\g\\ 2 . 

We conclude that if the assumptions of Theorem 0.1 hold then, for any given pair 
f,g of ^-functions supJ(^T £W 1/2 /, < A\\f\\ 2 \\g\\ 2 if and only if sup £ | (T , £ /, <?) | < 
-BH/IMMh- But this supremum can be computed as follows 



s e ^— ' {V)l (W)/ 



lev 



{w) 2 
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Clearly the estimate 



lev 

is equivalent to 



Ei(^i-i(V /2 )/ 



\{v)i. - (v)i+\ _ \Hi- - (w)i+ 
(v)i (w)i 



I\<C\\f\\ 2 \\g\\ 2 



\(T f,g)\<C\\f\\ 2 \\g\\ 2 . 



Now we can finish the reduction. Second and third assumptions of Theorem 0.1 say 
exactly that for pairs / = XJ wl ^ 2 -,9 — Xj9 an d / = Xjf-,9 — Xjv 1 ^ 2 the supremum 
of bilinear forms sup £ {(v^-^^w 1 ^ 2 f, g)\ has the desired estimate. The consideration 
written above then implies that on such pairs the bilinear form \(T f,g)\ has the 
desired estimate. But this means exactly that the second and third assumptions of 
Theorem 0.3 are satisfied. The reduction is finished. We saw that the assumptions of 
Theorem 0.1 imply the first three assumptions of Theorem 0.2 and all assumptions 
of Theorem 0.3, but they, in their turn, imply the fourth assumption of Theorem 0.2. 
So we are left to prove theorems 0.2 and 0.3. 

Notice that Theorem 0.2 is already proved. In fact, we established that the esti- 
mate sup £ \(T e f,g)\ < CH/H2II0II2 implies (1.1)-(1.3). We also know that (1.1)-(1.3) 
guarantee the equivalence of the inequality sup £ \(T e f, g)\ < C\\f\\2\\g\\2 with the 
inequality \(T f,g)\ < C| | 1 1 2 1 1 ^ 1 1 2 - So Theorem 0.2 is completely proved. 



In this section we show that relatively simple conditions (1.1)— (1.3) are already 
sufficient for uniform boundedness of 1 1 2^ 1 1 x, 2 (it> — 1 ^-x, 2 if weights have certain 
properties. In the first theorem below we assume that either v or w belong to A^. 

Theorem 3.1 If necessary conditions (1.2), (1.3) hold and one of the functions v 
or w is in Aoo then (2.7) holds too. 

Proof. Let v G A^. Then ( see [FKP]) we have 



3. 3. A^ CONDITIONS AND SUFFICIENT CONDITIONS. 



(3.1) 



VJ, 




Application of Lemma 2.1 now proves that 



(3.2) 




Then the left part of (2.7) can be estimated as 
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(Eiwf^J'w-') 




Now we apply Lemma 2.1 to the second factor. Notice that the condition of Lemma 
2.1 is satisfied (it is just (1.2)) we conclude that the second factor is at most C||/|| 2 , 
and the theorem is proved for the case v G A^. The case w G is completely 
symmetric. 



In the next theorem we require that 

(3.3) w G A^iy). 

We remind the reader that there are many equivalent wordings of this assertion, 
which can be found in [St]. For us here it will be important that this property means 
the following 

V £ >0EW>0, V/.^c/f <, 

v(I) w(I) 

Remind also that this property is symmetric. 



Theorem 3.2 If w is in A^v) and ifw,v have the doubling property then op- 
erator T is bounded if and only if (1.1) holds. In particular, in this situation 
sup £ ||T , £ || Z/ 2( w -i)^ L 2 ( - 1 ,) < oo if and only if (l.l)-(l.S) hold. 



Boundedness of T is equivalent to (2.7). We have to be able to prove (2.7) starting 
with our assumptions. An important particular case of (2.7) appears when one writes 
(2.7) for g = Xj yl ^ 2 i f = Xj w1 ^ 2 where J is an arbitrary dyadic interval. So we have 
to be able to prove the following "Carleson measure type inequality" : 



\(v)i.-(v) I+ \ \(w) I _-(w) 



ICJ 



Hi 

< Cy/(v)j(w)j 



It turns out that the proof of this inequality plays an important role in the proof 
of Theorem 3.2. 



Lemma 3.3 If (1.1) holds then the above inequality holds. Moreover, for any 
a G (0, 1], a^\, the following more general inequality holds 
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1^1 ^ (w>J 



We postpone the proof of Lemma 3.3 till the next section. We will see there that 
the constant C(ct) blows up when a approaches \. Now we use it to give the proof 
of Theorem 3.2. 



Proof of Theorem 3.2 Let J be in V and let J be a disjoint family of its dyadic 
subintervals having the following property 

(F) I (G) I >b (F)j(G)j _ 
(iu)j(u)/ ~~ (w)j(v)j ' 

Then ^f^p < £ and Ul ^j^ < £, where £ depends on B and is small if B is large. 
In fact, intervals / are either of the type that -j^jj > V^B-j^jy or of the type that 



> V-Bt\ • Let fii be the union of the first type intervals, and let f2 2 denote the 
union of the second type intervals, which are not in f^. Then obviously 



w(fii)MJ) < 5, u(ni)/u(J) < 5. 

We are ready to use that w G A^iy) (which also means v G A O0 (w)) to conclude 
that 



(3.4) ^1 < s, ^1 < , 

v(J) W \J) 

To apply this remark let us fix /, g from L 2 (T) and denote F = fw 1 ^ 2 , G = gv 1 ^ 2 . 
Let (F) , (G) are averages over T. Fix a very large number C to be chosen later and 
let Jk denote maximal dyadic intervals such that 

(w)i{v)i ~ (w)(v) ' 

Also let Qk denote the collection of dyadic intervals, which are contained in some 
J from J k and are not contained in any J from Jk+i- 

Now we use Lemma 3.3 and (1.1) to conclude the following 
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X^/f\ ir\ IM/- ~ ( w )i+\ \(v)i--{v)i + \ , n < 

mm M/ («)• 



^ (w)j{v)j v 



Here A = sup JeT) (w) j(v) j. 



For an interval J from let us denote by Ej the set J \ Li j e j k I. We would like 
to replace w(J),v(J) in the last sum by w(Ej),v(Ej). Suppose for a moment that we 
can do that. Let us denote by M v the dyadic maximal function with respect to mea- 
sure vdm. Do similarly for w to obtain M w . Notice that (G)j/ (v)j < M v (gv~ 1 ^ 2 )(x) 
for any x £ J. Similarly, (F)j/(w)j < M w (fw-V 2 )(x) for any x £ J. Notice also 
that all Ej are disjoint, and we assume temporarily that, say, w(J) < 2w(Ej) and 
v(J) < 2v(Ej). 

Then we finish our estimate as follows: 

\{w) T _-{w) I+ \ \(v)!_ - (u) J+ ' 



AX ^ |J| - 



lev 



Hi (v) 



e e ... < 2 cvz£ e ™f v^M^y < 

2CVA( J {M w {fw- 1 ' 2 ){x)fw{x)dxf/ 2 { J {M v {gv- 1 / 2 ){x)fv{x)dx) 1 ' 2 < 

2L7v / I||M w (/ W - 1 / 2 || i2(w) ||M,(^- 1 / 2 || L2M < C'||/|| 2 |b|| 2 

We are left to choose C so large that w(J) < 2w(Ej) and v(J) < 2v(Ej). To do 
that let us notice that the maximality of J in Jk and the doubling properties imply 
that 

C k < {F)AG } J < DC k . 
Hj{v)j 

So if I C J belongs to Jk+\ we have that 

{FW&i > C (F)j(G)j 
(w) I (v) I ~ D (w)j(v)j • 
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The remark at the beginning of the proof of the theorem shows that if C is much 
larger than D then the -y-measure of J n (Ui e j k I) is smaller than any given e (say, 
|). And the same is true about w-measure of this set. 

Thus w(J) < 2w(Ej) and v(J) < 2v(Ej), and the theorem is completely proved. 

4. 4. Bellman function and Carleson measures 

Here we are going to prove Lemma 3.3. We also apply the lemma to give another 
sufficient condition for uniform boundedness of our T £ . 

In the proof of Lemma 3.3 the Bellman functions approach appears for the first 
time. It will play the key role in the rest of the paper. 

Proof of Lemma 3.3. Without loss of generality we assume that 

(4.1) (v)iHi < 1 

Let us first prove the case a G (0, 1/2). Let us introduce the function 

$(x, y) = sup ^((t)) ; (u;>j) 
|J| ICJ 

where supremum is taken over all nonnegative finite linear combinations v, w of char- 
acteristic functions of dyadic intervals such that vj = x,wj = y. Notice that $ 
does not depend on J. Notice also that by the definition $ is concave. Moreover if 
x = *±+2~ , y = y±^- then 

(4.2) ■■■(,,,) CH,..,.)^H, )) :>jr//j „ 

If only one could prove that <&(x, y) < C(a)(xy) a .... Let us reverse the argument. 
Suppose we can construct a concave B in the domain D = {x > 0, y > 0, xy < 1} 
such that 



{v)i_ - (v)j 



- (w)j 



Hi 



X 



y+-y- 



y 



(4.3) 

and such that 



0<B<(xy) a , (x,y)EV 



y+-y- 



y 



(4.4) B(x, y) - W* + .V + ) + B{*-,V-)) > ^ 

— Ob 

if (x, y), (x+, y+), y_) G D and if 

x+ + x- y + + y^ 

Then we are done. 

And for a G (0, 1/2) it is easy to guess such a function: 

B(x,y) = (xy) a . 

Let us consider it in a larger domain D = {x > 0, y > 0} and prove (4.4) there. 
Consider the function 
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b(t)=B(x + t£,y + tri), t G [—1, 1] 

where £ = X+ ~ 2 X ~ , ?7 = y+ ■ Clearly 6 is concave, because 5 is concave in {x > 
0,2/ > 0}. We want to estimate b"(t) and to prove that 



(4.5) 



b"(t)<-c a -^- a , te [-1/2,1/2]. 



If (4.5) were proved, then we could write 

6(0) - i(6(-l) + 6(1)) = - T (1 - |*|)6"(^ > c^-J^L 

which is exactly (4.4). 

Let T~tx,y(£, v) t> e the Hessian form of B at point (x, y) on the vector (£, 77). Denote 
art = x + t£, y t = y + t£. Then 

b"(t)=H Xt>yt (t, V ). 
Here is the computation of H x ,y{C,i]) '■ 



;i_ a )(l)2 + (l_ a )(l)2_2a-^- 



(l_ 2a )((i) 2 + ^^ ^ - "" '' * 



x t 



yV x t y t 



< 



-(l-2a)a 1 _ a 1 _ a 
a* & 

Noticing that for £ G [—1/2, 1/2], x t > |, y t > | we get (4.5), which gives the proof 
of Lemma 3.3 for the case a < 1/2. 

Our key inequality (4.5) could be proved differently. Denote t = xy, x± = (1 ± 
^)x, U± — (1 i Af)y"> V ^ [ — 1? 1] we come to proving that for A,/i G [—1, 1] the 
following holds 



(4.6) 



1 - - {[(1 - A)(l - /.)]" + [(1 + A)(l + im)]"} > c a \M 



This elementary inequality is true when a G (0, 1/2), which can be checked by direct 
computations. 

Now let us consider the case a G (1/2, 1]. Again we should notice that it is enough 
to build a function B such that 

(4.7) 0<B(x,y)<C(xy) 1 / 2 , (x, y) G D, 
and 

(4.8) B(x,y) > c a (xy) 



x+ — x. 



X 



y+-y- 



y 
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where both inequalities hold in D = {(x, y) : x > 0,y > 0,xy < 1} and x = 

y++y- 

2 > y 2 

Unfortunately, B = (xy) 1 ^ 2 does not work. It is not concave enough near the 
diagonal x = y. Here is the function which satisfies (4.7), (4.8): 



B(x,y) = (xy) 1 / 2 -^xy) a . 



Again there is nothing to prove in (4.7). It is a nonneggative function because xy < 1 
in D. 

Consider the function 



b(t)=B(x + tt,y + tr]), 



where £ = x+ 2 X , rj = y+ 2 y . Clearly b is concave, because B is concave in {x > 
0,y > 0}. We want to estimate b"(t) and to prove that 



(4.9) b ''( t )<-^zM^^ te [-1/2,1/2]. 

x y 



If (4.9) were proved, then we could write 



6(0) - kb(-l) + 6(1)) = - f (1 - \t\)b"(t)dt > C{a)-^- a 
z J -i x y 



which is exactly (4.8). 

Let H Xj y(t;, rj) be the Hessian form of B at point (x, y) on the vector (£, 77). Denote 
x t = x + t£, y t = y + t£. Then 



b"(t)=H Xt ,yM,V)- 



Here is the computation of Ti. x , y (C,v)'- 
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n x , y (^v) = JW 1/2 [2(1 - a\xyT~^)^ - (1 - a(l - a)(x y r~ l/2 )£) 2 - 

{l-a{l-a){xyY-^)C-Y\ 

Let us denote by p the expression a 2 {xy) a ~ 1 / 2 and by r the expression a(l — 
a)(xy) a - 1 / 2 . Then 



«^) = Jn 1/2 {-(i-p) 



1 + a 

x y. 



-(p-r) 



x 



+ 



<--(p-r)( I y)^. 



1 

4' 



Let us look at p — r = (a 2 — a(l — a))(xy) a 1 ^ 2 
when a > 1/2. In particular, 



+a(2a — l)(xy) a x l 2 . It is positive, 



(4.10) 



a x 1 - a y 1 - a 



if t G [-1/2,1/2] because x t >%y t >\ for such i. Now (4.10) implies (4.9) 
immediately. The proof of Lemma 3.3 is completed. 



Let us show some corollaries of Lemma 3.3. Fix a number q G (0, 1). We denote 
by jF fc , k — 0, 1, 2, . . . the family of all dyadic intervals I such that 



q k+1 < VjWj < q k . 



Consider the following measure in the disk 



vi_ - v l4 



Vl 



Wl_ - Wi, 



Wi 



\m 



c(I), 



where c(J) denotes the center of the box Q(I) built over the dyadic arc /. 

Let us remind that measure a in D is called Carleson if cr(Q(I)) < C\I\ for all 
arcs /. The best constant in this inequality is called the Carleson norm of o and is 
denoted by \\u\\c- 



Lemma 4.1. Measures are Carleson measures and \[&k\[C < B,oo. 

The proof follows immediately from Lemma 3.3 with a G (0, 1/2). Let us notice 
that in the case when (v)i(w)i is bounded also from below (so in the case when we 
can assume w = u" 1 = v , and we consider a one weight problem) we get that J2T=o a k 
is a Carleson measure. 
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We will see now how a slightly strengthen version of (1.1) becomes sufficient for 
the boundedness of \\T Q \\ L 2^ w -r^ L 2^ (but it stops to be necessary of course). We use 
the Carleson measure result that has been just proved. 



Theorem 4.2. Suppose that 



(4.11) 



V/6P, 



)i(w 1+r ')i < A < OO 



Then \\T \\ L 2^ w -i^ L 2^ < C(A) < oo. 



Remark. There is a whole stream of results of this kind starting with Fefferman- 
Phong theorem from [F] and continuing in [ChWW], [Z]. 

Proof. Let us remind that the boundedness of To is equivalent to the following 
estimate for all L 2 -functions /, g 



(4.12) 
where 



E<^ 1/2 >/(/™ 1/2 W < c\\g\\ 2 \\fh- 



lev 



OLl 



(v)i_ - (u)j H 



(v)i 



(w) T _ - (w)j 



(w)i 



\I\. 



Clearly for 7 + p = 1/2, 7, p > 0, we have 



(gv i/2 )i < ( g ny p (v™) i I /qi (v™) 1 / q2 

where 1/p + l/q 1 + l/q 2 = 1. Choose 7 = p = 1/4, q 1 =4, q 2 = 4(1 + rf) to have 
P = 4 tS < 2 - Then 

(^ 1/2 )/ < (g^iv^^iv) 1 / 4 , 

(fw 1/2 )i < (f P ) 1 i /P (™ 1+V ) 1 / q2 W 1 / A - 
Remind that measures from Lemma 4.1 

ieT h 

are proved to be uniformly Carleson. Thus the measure 

lev k 

is a Carleson measure. Let us notice that 2/p > 1, and let us use the Carleson 
embedding theorem in L 2//p (see [G]) to estimate the sum in (4.12). 
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J2(9v 1/2 )i(f™ 1/2 )i®i < sM(v 1+r, )i(™ 1+v )i) 1/q2 
lev 1 

■ ^(f) 1 i /P {f P )Y P (( v )i( w )i) 1/ * a i 
lev 

( \ 1 ( x 1/2 

Kiev J \iev 

<c(p)|| ( r|| c ||^||^ll/lt / 4<c||/|U 2 |blU 2 . 

We are done. 

5. 5. The necessary and sufficient conditions: Bilinear weighted 

imbedding theorem 

We saw in Section 2 that given necessary conditions (1.1), (1-2), (1.3) the uniform 
boundedness of ||T e ||^2( w -i) — > L 2 (v) is equivalent to the following estimate for all 
L 2 -functions /, g 

(5-1) Y.^ 1/2 )i(fw 1 ' 2 ) I a I <C\\g\\ 2 \\f\\2 



where aii 



{v)i_ -{v)i + 1 


(w)i_-{w)i + 


\I\ 


(«>/ 


(w)i 





We may (and should) consider (5.1) as a 



Consider the kernel function 

k(x,y) = TjpXi(x)Xi(y)®i 
lev ' ' 

It is obvious that (5.1) means the estimate for any nonnegative measurable G: 



(5.2) J (^J k(x,y)G( y y)w( y y)dy^j v(x)dx < C J G 2 wdy 

The inequalities (5.2) have been extensively studied in the works of Kalton, Sawyer, 
Verbitsky, and Wheeden. The next result is in the vein of those works. Inequalities 
(5.2) can be most probably proved using the combination of ideas frim the papers of 
Kalton, Verbitsky [KV] and Sawyer, Wheeden [SW]. Following [KV] we can introduce 
the new metric d(x, y) := fc / . , where k is the kernel of T and was written above. It 
is clearly a metric, and balls in this metric are just all dyadic intervals. Then kernel k 
satisfies the regularity condition from [SW]. Using the main result of [SW] we could 
have given an alternative proof of inequalities (5.2). Unfortunately it is not clear 
why the new metric space has certain regularity properties. For example, in [KV] 
one requires the property that all annuli be nonempty. This is false in our new metric 
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space. However, it is not clear to us how essential are those regularity properties of 
the metric space for the application of the technique of [SW] . 
So we will give another proof. 

Plug into (5.2) the charecteristic functions of the intervals. Then we get a condition 
necessary for (5.2). Do the same with the dual inequality. Then we get another 
necessary condition. These are are the conditions of the Sawyer type: 

(5.3) yJeV, l-J^^k(x,y)w(y)dy S j v(x)dx < C(w)j, 

(5.4) yJeV, ^ J^h(x,y)v(x)dxj w(y)dy < C(v)j. 
In other words, for all dyadic arcs J 




Let us denote by T the operator with kernel k. 



Theorem 5.1. Let be a nonnegative sequence. The following assertions are 
equivalent 

1) Operator T is bounded from L 2 (w~ r ) to L 2 (v); 

2) Estimate (5.1) holds; 

3) Estimate (5.2) holds; 

4) For any J G V (5.5) and (5.6) hold with the same constant C independent of 
J. 

Remark: Clearly (5.5) (or (5.6) as well) implies 

(5-7) ^ J>)/<™W < Cy/(v)j(w)j, 

|J| ICJ 

which is the conclusion of Lemma 3.3 for the case a — 1, and so (5.7) follows from 
just (1.1). Unfortunately, (5.5), (5.6) themselves do not follow from (1.1). 

On the other hand (5.7) is a particular case of our key estimate (5.1). In fact, (5.1) 
becomes (5.7) if g = XJ yl ^ 2 if = Xj w1 ^ 2 - The conclusion is that the particular case 
of (5.1) for test functions Xjv 1 ^ 2 -, Xj w1 ^ 2 follows from (1.1). But this is not the case 
for the full of (5.1). However, (5.5) and (5.6) are in fact the conditions obtained by 
testing (5.1) by certain test functions. We are going to repeat what has been already 
said above. Namely, it is easy to see that (5.5) is the testing of (5.1) by f — x.J w1 ^ 2 
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and g = g J , where g J is any L 2 -function supported on J. Similarly (5.6) is the testing 
of (5.1) by g — Xj 1 * 1 ^ 2 an d / = f J , where f J is any L 2 -function suppoprted on J. 

Let us conclude that in a particular case v = w the bilinear weighted imbedding 
estimate (5.1) becomes a usual weighted L 2 -imbedding estimate for linear operator 

(5-8) £<iV/ 2) 2 a/ < C\\f\\l 

Test conditions (5.5), (5.6) become 



(5.9) V J G V, ^(w) 2 a 7 < C(w)j. 

|J| ICJ 

Let us start our proof of Theorem 5.1 with a special proof of the classical equiva- 
lence (5.8) <=> (5.9) (see [SI], [S2], [TV]). 

6. 6. The necessary and sufficient conditions: the proof of Theorem 

5.1 for the case w = v 

Theorem 6.1 (5.8) ^ (5.9). 

Proof. One needs to prove only the implication (5.9) (5.8). Let us consider 
the function 

$j(X, x, w) = sup — ^2(fw 1/2 ) 2 iUi 

|J| ICJ 



where supremum is taken over / > 0, (fw 1 ^ 2 )j = x, (f) 2 = X, (w)j = w. Clearly, 
this restricts us to 

D = {(X,x,w) : x 2 < Xw, X,x,w> 0}. 

The family of functions $j is concave in the following sense. Let X = X ~% X+ , x = 
^±£+, w = Then 



(6.1) $j(X,x,w) - i($j_(X_,x_,w_) + <5> J+ {X + ,x + ,w + ) > jj^ 2 aj 

In fact, fixing averages on J_, J+ separately leads to a smaller set of functions 
when if fixing the averages only on J (we keep the compatability by saying that the 
averages over J are arithmetic means of averages over J_, J + . 

One wishes to prove that $j < CX. This certainly would give the result. But this 
is certainly false because we did not use the condition (5.9) in the definition of $j. 

Let us try to correct this by introducing 



Vj(X, x, w) = sup — J2(f wl/2 ) 2 i a i 



ICJ 



where supremum is taken over the same set of conditions including (w)j = w plus 
the condition that {{w)i}i c j are so distributed that 
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(6.2) l r Y,^)hi<C{w) JI , VJ'cJ 



ICJ' 

Unfortunately, it is now not clear why (6.1) would hold with replacing $j. 
Conditioning as (6.1) makes unclear why {$j} has even simple concavity property. 
In fact, by passing to J_, J + from J we did not deminished the set over which the 
supremum is taking. Unlike the case of $ above we actually relaxed our requirements. 
Notice that inequality (6.2) for J = J will not be required anymore after passing to 
J_,J + . 

But one can overcome this difficulty by fixing the "main" sum in (6.1) and making 
its value M the new variable. To do that let us consider 



$j(X, x, w, M) d M sup -L ^(fw^ar 



ICJ 



where supremum is taken over such functions that / > 0, (fw l l 2 )j = x, (f)[ 
X, (w)j = w, and over sequences and functions such that 



(6.3) l r Y,^)]a I <C{w) J> , VJ'CJ, 

|J 1 ICJ' 

and such that 

(6.4) M = iEWK 



ICJ 



Notice that thus defined B does not depend on J, and that it is cleary defined and 
concave in the domain 



D B = {(X, x, w, M) : x 2 < Xw, (X, x, w, M) > 0, M < Cw}. 
On the top of concavity we know the following. Let h = M — M -+ M + . Clearly, 



B(X,x,w,M_) + B(X,x,w,M + ) x 2 
Thus we conclude that B satisfies the following properties 



(6.5) B(X, x, w, M) - ^^^^-)^^^ + ) > ±_ h . 



(6.6) < B < X, on V B , 

dB x 2 
dM ~ v? 



( 6 - 7 ) > — . on V b, 



(6.8) d 2 B < 0, on V B , 
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Instead of looking for the exact formula for B let us consider 

2 

B(X, x, w, M) = CX — C 



w + M' 



which satisfies all the properties (6.8)-(6.10). To finish the proof of (5.8) it is enough 
to consider triples x = (fw l l 2 )j, X = (f) 2 , w = (w)j,x± = (fw 1 ^ 2 )j ± , X± = 
(f)j ± , w± = (w)j ± and M, M± such that M — M -+ M + — ^L w 2 ajj anc l to prove that 

B(X_,x.,w^M^) + B(X + ,x + ,w + ,M + ) 1 2 
B{X,x,w,M) > olj. 

But the last inequality follows immediately from (6.7) and the concavity of B. 

Remark. It is interesting to notice that one can give another euristic explanation 
of appearance of new variable M. Let us think that all averages are variables and all 
sums are functions defined on the set of all averages involved. Then M is a pretty 
concave function of w in the following sense: 



(6.9) M(wj) - \[M{w J+ ) + M{wj_)\ > i^wjaj. 

Thus it is resonable to look for Q(X, x, w) in the form Q = B(X, x, w, M(w)), where 
B has a large derivative with respect to M. Then we can use the following formula 
for the Hessian of the composition 

OB 

(6.10) d 2 Q = d 2 B + — - d 2 M. 

v ' dM 



7. 7. Necessary and sufficient conditions. The proof of the bilinear 

weighted imbedding theorem 

We try to repeat the argument of the previous section. First of all it would be 
nice to discern the variables of the future Bellman function. Some of them are 
ready right away. These are X = (f 2 )j,x = (/w 1//2 )j,w = wj, Y = (g 2 )j,y = 
{gv l / 2 )j, v = vj. Imitating the considerations of the previous section we also fix M = 

jTf fj (j2icj j7fX/W/aj) vdx and N = jTf Jj (j2icj ]f\Xi v i a i) wdx - By doing this 
we are making the following function concave: 



$(X, x, w, Y, y, v, M, N) d ^ sup ^ J](/«; 1/2 ) j( ff u 1/2 ) /«/ 



ICJ 



where the supremum is taken over nonnegative functions with "main" averages fixed 
as above and such that 
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, I C J |j- I f ^ ji-X/'Z/'O!/' j w Gte < Cw/ 
' ' ^ 7 \/'c/ I I / 

, / C J i^j- J ^2 jj^Xi' v r a i^J wdx< Cvi 



V/ e V 



We could try to use $ as Bellman function of our problem having hope that it 
gives our main inequality (5.1). Actually it does not. It gives a certain inequality 
but it is weaker than the one we need. But on the other hand we used all possible 
variables (averages, conditions). There are no more averages involved, and there are 
no more conditions (if we believe that our bilinear imbedding theorem is true). 

First let us find the way out of this impass by euristic considerations. Notice that in 
the previous sections once the Bellman function is found we apply a certain inequality 
not to the Bellman function itself but rather to its composition with martingales. In 
fact, all these (/ 2 )j, t> = (v)j are martingales. There are also supermartingales. 
For example, 

2 




TjrXl w I a I j v dx 



■ t Mj_+Mj 

is a supermartmgale. Let us pay attention to its discrete Laplacian Mj ^ — -. 

If we consider a supermartingale mj = f pr J2icj( w ) 2 i a i f rom the previous section we 
see that its "discrete Laplacian" mj — m ' J ~ ^ m ' 7+ is equal to -^(w)^aj. We notice that 

it involves only martingale (w)j, which has been already chosen as a variable (called 
w) of the Bellman function. 

What happens with "discrete Laplacian" of supermartingale Mj? When we calcu- 
late it, we get 



(7.1) Mj - J - 2 J+ = jj-(w)jKjaj, 

where 

JCj = -At («;)/(«) jaj. 

|J| /CJ 

So the "second derivative" of supermartingale Mj involves not only the martingale 
(w)j but also a new supermartingale Kj. 

The supermartingale Kj, which appeared as "discrete Laplacian" of our "natural 
variable" supermartingale M also should be embodied by a variable-naturally let us 
call it K. 

One may now try to continue this process by taking the "second derivative" of Kj\ 
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K j -\- K j 1 , , , , 
(7-2) Kj - J - 2 J+ = —{ W )j( v ) ja j 

to make sure that it involves only already considered martingales (w)j, (v)j. Fortu- 
nately no new variables appear after K. 

The recipe. Let us repeat that we are calculating the "discrete Laplacian" of 
the composition of a certain (concave) function (Bellman function) and several mar- 
tingales and supermartingales. It is only too natural that in this calculation the 
"discrete Laplacians" of all these martingales and supermartingales appear. For 
martingales they will be zero, for supermartingales they are positive and involve our 
martingales and may be some new (super) martingales. If this happens, this new 
(super)martingales should be added to the list of (super) martingales, which must 
become the variables. 

In the situation we have now we have another supermartingale Nj to which we 
have to apply this algorithm. It is given by the formula: 




And so its "discrete laplacian" can be calculated as follows: 

, \ Nj + Nj, 1 , v 

(7.3) Nj - J - 2 J+ = jj-(v)jKjaj, 

where Kj is the same supermartingale we found by calculating the "discrete Lapla- 
cian" of Mj above. 

Now we have all the variables: X,x,w,Y,y,v, K, M, N. Let us make the notation 
for this 9-tuple and also for X, x, w, Y, y, v, M, N and X, x, w, Y, y, v, K. 

a = (X, x, w, Y, y, v, K, M,N),b = (X, x, w, Y, y, v, M,N),c = (X, x, w, Y, y, v, K). 

For technical reason we will be looking for B in the form B(a) = Q(b) + P{c). 
The domain of definition is 

D = D B = {a = {X, x, w, Y, y, v, K, M, N) : x 2 < Xw, y 2 < Yv, 

M < Cw, N < Cv, K < CVwv, a > 0}. 

The only thing requiring the clarification is why K < C^/wv here? But K stands 
for supermartingale Kj for which this ineqality was proved in Lemma 3.3. So this 
ineqality represents a natural restriction on K. 

What inequalities on B would be sufficient for us? It should be the inequal- 
ity on the composition of B with all our (super) martingales. So consider aj = 
((f 2 ).J, (fw 1/2 )j, wj, (g 2 )j, (gv 1/2 )j, vj, Kj, Mj, Nj) and a J± denoting the same thing 
for J± instead of J. Here are the inequality we want to obtain 



(7-4) 



B(aj_) + B(a J+ ) 1 1/2 1/2 
B{aj) > CjjAfw )J\9 V )J a J 



26 



F. NAZAROV, S. TREIL, A. VOLBERG 



(7.5) < B(a) < C(X + Y) 

If we have both of these inequalities then moving from J to J± and then continuing 
these process to "sons" of J± et cetera... we obtain the inequality 

(7-6) ^ Y,^ 1 ' 2 )^ 112 )^ < C((f)j + (g 2 )j) 

|J| ICJ 

The use of homogenuity of the left part (it does not change under the replacement 
of / by tf, g by t^g) would finish the proof of our bilinear weighted imbedding 
theorem because we obtain 

^E^ 1/2 )^^ V2 )^ ^ CV(/W>. = C\\f\\ L ,\\g\\ L y 



ICJ 



Lemma 7.1 To have (7.4) it is sufficient to have the following inequalities 

(7 . 7) B («) - g(a - ) + B(a + ) > ^{K - K ~t K+ ) 
y ' w 2 ' wv y 2 ; 

if 

2 2 

(7.8) — K + V —K < Cxy 

w v 

and 

( 7. 9 ) B(a) - £W±£K) > l2 £i, M _ !L+M± ) + ,,yL [N _ ^), 

2 2 

(7.10) — X + —K > Cxy 

w v 

Here a > a -+ a + . ^4 n ^ ^ — 1,2,3, are positive constants. 
Proof. Let us calculate the right parts of (7.7) and (7.9) for 

a = (X, x, w, F, y, v, K, M, N) = ((f) j, (fw^j, Wj , 
(g 2 )j,(gv 1/2 )j,vj,Kj,Mj,Nj), 

and a± having the same meaning with J replaced by J±. We do this calculation by 
using (7.1)-(7.3). In both cases, the right part is bigger than 

-^xyaj = j^(fw 1/2 )j(g 2 )jaj. 

Notice that we use (7.10) when we calculate the right part of (7.9). At any rate in 
both cases we get (7.4). 
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Now we are left to find B satisfying (7.7) and (7.9). As we mentioned we will be 
looking for B in the form B(a) = Q(b)+P(c) where a = (X, x, w, Y, y, v, K, M, N), b = 

(X, x, w, Y, y, v,M,N),c= (X, Y, x, w, y, v, K). 

2 2 

Lemma 7.2 Function Q = CX + CY — w £ +M — -^-^ satisfies the inequality 

(7 .„) m _ Q(M + 0(M > 72 £l (M _ + «> _ ^ 

2 w 2 v 2 

if be D Q = {b = (X,x,w,Y,y,v,M,N) : x 2 < Xw,y 2 < Yv, 
M <Cw,N <Cv,b> 0}. 

Proof. This is a direct computation. It easily follows from the following infinitesi- 
mal estimates for Q: 



,2 

dM ~ w 2 ' 



dQ cx 2 



dQ cy 2 
ON ~ v 2 ' 

al 2 Q < 0. 

Lemma 7.3 Function 

P = CX + CY - sup (—?—— + 



0<s<oo \w + sK V + s 1 K 
satisfies the inequality 

(7.12) P(c) -^> + f ( c +> _*-+*+) 

—X + —if < Cxy 

IU V 

ifceDp = {b= (X,x,w,Y,y,v,K) : x 2 < Xw,y 2 < Yv, 
K < C^m,c> 0}. 

Proof. We hope that the formula for this function is as much surprising for the 
reader as it was for us. But once the formula is written the rest is a direct computa- 
tion. It easily follows from the following infinitesimal estimates for P. 

Optimal s can be found from the equation 
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Now we can make the direct computation of If sK < cw and s 1 K < Cv, 
then 

OP x 2 y 2 xy 

oK w z v z wv 
It is left to show that if sK » w or if s~ l K » v (the symbol >> means "much 
larger than" ) then we are automatically in the area where 

X 2 y^ \ 

1 ) K > cxy. 

W V J 

In fact, if s _1 K » v then sK « w. This is because we are in the domain where 
K < C^fwv Then (7.12) becomes 

SK 2 1 2 

w z s 1 K 

Thus ^ x and — K > cxy^— = cxy. Similarly, if sK >> w and thus 

s^K << v , equality (7.12) gives |j x and so \K > cxyj^^ = cxy. Lemma 7.3 
is completely proved. 

So bilinear weighted imbedding theorem is fully proved. 



8. 8. HlLBERT TRANSFORM. SUFFICIENT CONDITIONS VIA GREEN'S 

POTENTIALS. 

Let H stands for the Hilbert transform on the circle T. In other words, operator 
H (defined first on trigonometric polynomials) acts by the formula: 

k>0 k<0 

Throughout this section we use the notation f(z) for the Poisson extension of the 
function / G L : (T) to the disc D evaluated at the point z G D. So, for example, 
f(z) 2 and f 2 (z) are different in general. 

The two weights estimates for the Hilbert transform appear naturally in the theory 
of Hankel and Toeplitz operators and in the perturbation theory of linear operators 
(including the perturbation of differential operators). 

For the case of the Hilbert transform there exists the approach of M.Cotlar, 
CSadosky through Krein's moment theory. Their approach (which can be referred as 
Generalized Bochner Theorem) provides both integral representation and extension 
of forms and kernels invariant under the shift operator. Being applied to a special 
bilinear form built with the help of the Hilbert transform and two measures, this 
approach gives a necessary and sufficient condition for the Hilbert transform to be 
bounded between L 2 -spaces with respect to these measures (see [CS1]). The ap- 
proach of M.Cotlar, CSadosky is very intersting because it provides a direct link 
between the lifting theory of Sz.-Nagy and Foias and thus the scattering theory) and 
the continuity of the Hilbert transform in weighted spaces (see [S]). 

In the case of equal measures Cotlar-Sadosky theorem is a direct analog of Helson- 
Szego characterization of A 2 weights. 
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On the other hand Cotlar-Sadosky theory leaves many questions. It does not pro- 
vide any analog of the Hunt-Muckenhoupt-Wheeden characterization of A 2 weights. 
And it seems impossible to provide in their frame the treatment of general Calderon- 
Zygmund operator between two weighted spaces. 

There is an extensive literature consisting of separately necessary and sufficient 
conditions in the spirit of the Hunt-Muckenhoupt-Wheeden characterization of A 2 
weights. In most cases authors assume some sort of for one (or both) of the 
weights. Another kind of results is represented very well by the following theorem of 
Dechao Zheng in [Z]: H is bounded from L 2 (w _1 ) to L 2 (v) if for a positive number 77 

sup v 1+v (z)w 1+v (z) < 00. 

Sarason asked a question essentially equivalent to the question whether one can get 
rid of 77. The counterexample of Nazarov [N] shows that i] cannot be made 0. On 
the other hand, in [TVZ] it is shown that one can weaken the above assumption of 
Dechao Zheng if one uses certain Orlic norms instead of L 1+v . 

Here we add another list of sufficient conditions to the existing collection of such 
conditions. We do not know whether our list represent necessary conditions. Most 
probably it does not. However, our conditions are completely different of those found 
before. We found them by the means of the same Bellman functions we have used 
above to solve the two weights problem for the Haar multiplies (which we consider 
as a "discrete analog" of two weights problem for H). 

Let us introduce further notations. Let P z {t) = \i^t\ ^ e ^ ne P°i sson kernel with 



pole at z e D. Let G(z,() = log 



l-zf 



be Green's function with pole at z. If \i 



def 

is a measure on D then Green's potential of this measure is given by G(fj,)(z) = 
J B G(z,Q dfJ,(C)- It is positive if \i is positive and A(G ([/,)) — /i in the sense of 
distributions. We always abbreviate G(fdxdy)(z) to G(f)(z). 

If u(z) is a harmonic function then u (z) denotes the holomorphic function 

Lemma 8.1. Let f = be a k-tuple of real valued L 1 (T)- functions, f(z) 

be a corresponding k-tuple of harmonic functions, f (z) be a corresponding k-tuple of 
analytic functions. Let B be a function of k real variables, and let d 2 B denote its 
Hessian. Put b{z) = B(f(z)). Then Ab(z) = A(d 2 B(f(z))f'(z), f'(z)), where (•, •) 
means the scalar product in C k . 

Proof. This is the direct computation of using the harmonicity of f(z). See 
[NT]. 

Our last notation concerns an operator with positive kernel, which is related to H 
the same way as To was related to T £ . We first write the bilinear form of this Hq. 
Let /, g be from L 2 (T), and let (•, •) means now the scalar product in L 2 (T). Then 
we fefine H as follows 



W,g) = f [ f(z)g(z) 
Jo 



w'(z) 



w(z) 



v'(z) 



v(z) 



[1 — \z\) dxdy . 
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One can easily see that H is the operator with positive kernel (s, t e T) 



h(s,t) = [ P z (t)P z (s) 
Jo 



w'(z) 



w(z) 



v'(z) 



v(z) 



[1 — \z\) dxdy . 



Now we are ready to formulate and to prove our result about H. 

Theorem 8.2. The following conditions together are sufficient for the boudedness 
of H from L 2 (w~ l ) to L 2 (v): 



(8.1) 

(8.2) 
(8.3) 
(8.4) 



sup w(z)v(z) < oo ; 

G(\v'\ 2 w)(z) <Cv(z); 
G{\w'\ 2 v){z) <Cw{z); 
H-^oIIl 2 ^- 1 )^ 2 ^) < CO 



Remark. Thus, the problem for singular integral operator is reduced to the prob- 
lem for the operator with positive kernel. In the discrete case considered in Sections 
1-7 the corresponding kernel k did not satify the regularity conditions usually im- 
posed on the kernels to apply Sawyer's theory (see [SW]). However, we found (see 
Theorem 0.3) necessary and sufficient conditions for the operator T with kernel k to 
be bounded. Now we can do the same. The kernel h of Hq does not satisfy in general 
any regularity conditions unforunately. Still we can immitate Theorem 0.3 and prove 



Theorem 8.3. Put K(z) = G(\w v \)(z), and put M(z) = G(Kw^), N(z) = 

G(Kv ^ w w ^ - ). Then the following conditions together are sufficient for the boudedness 
of H from L 2 (w~ l ) to L 2 (v): 



(8.5) 
(8.6) 



M(z) < Cw(z); 
N{z) < Cv(z). 



We leave Theorem 8.3 to the reader. Now we prove Theorem 8.2. 

Proof of Theorem 8.2. Let /, g be continuous real valued functions on T. First 
of all we can assume g(0) = 0. We also may assume that w, v are continuous because 
the estimates we are going to get are independent from these assumptions. Clearly, 



Hf ■ gdm 



<C / \f\z)\\g\z)\{l-\z\)dxdy. 



Notice that, immitating the sums from Section 2, we can write 
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\f {z)\\g {z)\{\ - \z\) dxdy 



+ 



+ 



< ! \M\\g(z)\ 
\m\\g(z)\ 

[ \f(z)\\g(z)\ 
Jo 

\m\\g(z)\ 



w (z) 



f(z) w(z) 



g'(z) v'(z)\ 



g(z) v(z) 



(1 — \z\) dxdy 



w'(z)\ 



w(z) 
v'(z) 



g'(z)\ v'(z) 



+ 



v(z) 
w (z) 



g(z) v(z) 
f'(z) w'(z) 



W(Z) 



f(z) w(z) 
v'(z) 



(1 — \z\) dxdy 
(1 — \z\) dxdy 



v(z) 



(1 — \z\) dxdy 



— Ei + T12 + S3 + E4 

Now let us think that / = g = ipv 1 / 2 . We only need to prove that Ej, i = 

1, 2, 3 are estimated by C 1 1 ^ 1 1 2 + llV'lli- m fact Replacing / by tf and g by t~ l g allows 
then to estimate these sums by C j| 0|| 2 1| || 2 - 

Let us first do it for Si. Consider B(X,x,w,Y,y,v) = f X — ^ + Y — ^. Put 
b(z) = B((f) 2 (z),(f)w 1 ^ 2 (z),w(z),tlj 2 (z),^v 1 ^ 2 (z),v(z)). Using Lemma 8.1 we can com- 
pute its Laplacian and see that it is superharmonic and moreover that (f(z) = 
<Pw 1 / 2 (z),g(z) = ^ 2 (z)) 



-A(b{z)) > c 



w(z) 



f'(z) w'(z) 



f(z) w(z) 



+ c 



v(z) 



9 W 



v'(z) 



Thus, 



-A(b(z)) > c- 



\f(z)g(z) 



y/{w(z)v(z) 

And so we obtain using (8.1): 



f'(z) w'(z) 



(8.7) 



-A(b(z)) > c\f(z)g(z) 



f(z) w(z) 
f'(z) w'(z) 



g(z) v(z) 
g'(z) v'(z) 



viz 



f(z) w{z) 



9(z) 
g'(z)\ v'(z) 



g{z) v(z) 



Notice also that function b vanishes on the circle T. In fact, 4> 2 (z) 



(iPv 1 / 2 (z)) 2 



{<jyw 1 / 2 {z)f 
w(z) 



on T. Notice also that 6(0) < 



v(z) 

Applying Green's formula, we get 



+ 



(8.8) - / A(b(z)) log -!- dxdy = b(o) - bdm 

Jo \z\ J J 

Combining (8.7) and (8.8), we obtain that 

^i<cH\\i + \mi. 



< 



2 + 

2 T 
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Estimate of S 2 , S 3 are similar to each other. 

Let us estimateE 2 . To do that we need the following p(z) (remind that / = 0u> 1//2 ) 



p( Z ) = (f) 2 ( Z ) - 



w(z) + G(\w'\ 2 v)(z) v(z) 
Notice how small is the difference with b. But this small difference allows the following 
estimate of the Laplacian (we use here Lemma 8.1 again and also formula (6.10)): 



-A(p(z)) > c- 
Thus, 



/(*) 



; A(w(z)+G(\w'\ 2 v)(z))+c 



w{z) + G(\w'\ 2 v)(z)) : 



viz 



9 W 



v'(z) 



9{z) 



viz 



-A(p(z)) > c 



w'(z)\' 



f(z) 2 v(z) + c 



9{z) 



g'(z) v'(z) 



9{z) 



v(z) 



w{z)* v(z) 

We used here our second assumption from Theorem 8.2: G(\w'\ 2 v )(z) < Cw(z). 
Thus, 



-A( P (z))>c l -^\f(z)g(z)\ 



9W 



v'(z) 



g(z) v(z) 



Again p vanishes on T and again obviously p(0) < 
Thus, using Green's formula as before, we get 



\w'(z)\ 



w(z) 



\f(z)g(z) 



g\z) v\z) 



9{z) 



viz 



l 0g dxdy < c-\U\\l + U\\l), 



which is the estimate of S 2 . Similarly we treat S 3 using the third assumption of 
Theorem 8.2: G(\v \ 2 w)(z) < Cv(z). Theorem 8.2 is completely proved. 

9. 9. TWO WEIGHT NORM INEQUALITIES FOR S-FUNCTIONS. NECESSARY AND 

SUFFICIENT CONDITIONS. 

Square functions play an important role in the theory of Singular integral operators. 
Often the estimate of Singular integral operators goes through the estimate of a 
certain S'-function. The multitude of examples can be found in [St]. In one of the 
most recent examples in [V] this approach was applied to characterize the matrix A p 
weights. 

Let us consider 

S(/)W=£|(/>;_-(/>/ + | 2 . 
i-.xei 

So when / — > S(f) is bounded from L 2 (u> _1 ) to L 2 (v)l The following theorem 
gives the answer. It says basically that this is so if and only if there is a uniform 
bound on test functions / = wxj- 



Theorem 9.1.\\S(f)\\ L 2 {v) < C\\f\\ L 2 {w -i } if and only if (1.1) and (1.2) hold. 
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Proof. Necessity is simple. To prove the sufficiency we again use the Bellman 
function approach. Let us consider 

2 2 

B(X, x,w,M) = CX-- . 

v 7 w w+M 

We are going to compose it with the following (super) martingales 

X I = (0 2 ) 7 , Xj = (0 W 1/2 ) 7 , Wl = (w)j 



' ' eci 



(v)t\t\ 



By (1.2) Mi < Cwi. Then we have the supermartingale 

b(I) = B(X I ,x I ,w I ,M I ). 

6(7) can be estimated using Lemma 



Its "discrete Laplacian" A(6)(7) = &(J - ) + b(J+) 
9.2 bellow (we use the notations / = tfiw 1 / 2 ): 



M inf 

{W)! ci,c 2 6[l/2,2] 



</>/- - (f)i + Hi- - Hi + 

Cl c 2~ 



(/>i 



+ 



We want to continue the estimate of the negative Laplacian from below. To this end 
let us consider two cases. 



First case: 



(w) I _-(w) I 



Mi 



< 



10 



</>/_-</) J 4 



</>/ 



In this case we use the first term in 



the estimate of our negative Laplacian to see that 

(/>? (/>/-" (f)i + 



-A(6)(7) > 0.09c- 



</>j 



And so, 



(9.1) 



-A(fe)(7) > c 



'!</>/- -</>/- 



Hi 



>c\{f)i--{f)i + \ 2 {v) I . 



In the last inequality we use 

(w) I _-(w) I 



Second case: 



(w)i 



(1.1). 



(/>/_ -</}/, 



if) I 



In this case we use the second term 



in the estimate of the negative Laplacian. Taking into account the ineqality above 
we get 



(9-2) "A(6)(7)> 0.01 -|</V -</>/+!»/• 

So we get the same estimate as in the first case. Now we can just apply Green's 
formula for our discrete Laplacian, which amounts to just applying (9.1), (9. 2) to 7, 
then to 7±, then to "sons" of 7 ± , et cetera... to obtain 
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TTfZ) K/>'- " </>/ + l»/l J l < CB(Xj,xj,wj,Mj) < C{<p 2 )j. 

|J| ICJ 

But (0 2 )j = 11/11^2(^-1), and Theorem 9.1 is completely proved. 



We are left to prove the following lemma. Let us denote a = (X,x,w), and let 



P(a) d ^ X-^,Q(a,M) d ^ X 



w+M ' 



Lemma 9.2. Let a = *=±2± M > M -+ M + . Then 



-d 2 P = 2- 



dx dw 
X w 



Q(a, M) 



Q{aJ) + Q(a+) x 



>c—(M 



M_ + M. 



+ 



P a i — - y — > c— inf 

2 IU ci,c 2 6[l/2,2] 



Cl" 



- c 2 — 



); 



10 



Proof. The first equality is obtained by direct computation. It shows that B is 
concave. Let a(t) be the linear function on [—1, 1] assuming values a_, a + at endpoints 
(and thus a = a). Let M(t) be the piecewise linear function on [—1, 1] assuming 
values M_,M, M + at —1,0,1 correspondingly. Then M(t) is concave, and M" (t) = 
^ M-+m+ _ ]^^ 0; w here So is a Dirac measure at 0. Notice that Q is a composition 
of P and a linear function. So Q is concave. Thus, by (6.10) — d 2 Q > — j^d 2 M. In 
particular, if q(t) = Q(a(t),M(t)) then measure g"(t) satisfies 

/ rc(t) 2 M_ + M +x . 



w (ty 



Thus 



9(0) 



g(-l) + g(l) 



(1-|«D? (t)>c— 2 (M — ^t), 



and the second inequality is proved. 

To prove the third inequality of the lemma let us introduce p(t) = f P(a(t)). Using 
the calculation for —d 2 P we obtain that then measure p (t) satisfies 



x_ — x + — 



x(t) 



w(t) 



On [-1/2, 1/2] we have that ^ G [1/2, 2] and ^ G [1/2, 2]. Thus, on [-1/2, 1/2], 
we have 

„2 



w(t) 



X 



—p (t) > c — inf 

W ci,c 2 6[l/2,2] 



Cr 



- c 2 — 



We finish the proof by combining this inequality with the following one 
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10. Concluding remarks 

1) Seems like the case p = 2 is a true miracle, because in this case we were able to 
give a finite list of simple conditions which are necessary and sufficient for two weight 
boundedness of our family T± of Calderon-Zygmund operators. As for p ^ 2 case, 
there are strong indications that the similar list of conditions (which can be actually 
copied from the u p = 2" case) will not be equivalent to two weights estimate. 

Let us notice that there exists an approach through Cotlar-Sadosky theory to two 
weight estimate for the Hilbert transform even for p ^ 2 (see [CS2]). 

2) One wonders whether the 5 conditions in our list in Section 1 are independent. 
This is most probably so, but the proof should be quite involved. At least it follows 
from [N] that (1.1) does not imply neither (1.2) nor (1.3). 
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